Hypergraph factorization for multi-tissue gene expression imputation

نویسندگان

چکیده

Abstract Integrating gene expression across tissues and cell types is crucial for understanding the coordinated biological mechanisms that drive disease characterize homoeostasis. However, traditional multi-tissue integration methods either cannot handle uncollected or rely on genotype information, which often unavailable subject to privacy concerns. Here we present HYFA (hypergraph factorization), a parameter-efficient graph representation learning approach joint imputation of cell-type expression. agnostic, supports variable number collected per individual, imposes strong inductive biases leverage shared regulatory architecture genes. In performance comparison Genotype–Tissue Expression project data, achieves superior over existing methods, especially when multiple reference are available. The HYFA-imputed dataset can be used identify replicable genetic variations (expression quantitative trait loci), with substantial gains original incomplete dataset. accelerate effective scalable tissue transcriptome biorepositories.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Matrices Factorization with Application to Missing Sensor Data Imputation

We formulate a multi-matrices factorization model (MMF) for the missing sensor data estimation problem. The estimation problem is adequately transformed into a matrix completion one. With MMF, an n-by-t real matrix, R, is adopted to represent the data collected by mobile sensors from n areas at the time, T1, T2, ..., Tt, where the entry, Rij, is the aggregate value of the data collected in the ...

متن کامل

HDTD: analyzing multi-tissue gene expression data

MOTIVATION By collecting multiple samples per subject, researchers can characterize intra-subject variation using physiologically relevant measurements such as gene expression profiling. This can yield important insights into fundamental biological questions ranging from cell type identity to tumour development. For each subject, the data measurements can be written as a matrix with the differe...

متن کامل

Hypergraph-Based Unsymmetric Nested Dissection Ordering for Sparse LU Factorization

In this paper we present HUND, a hypergraph-based unsymmetric nested dissection ordering algorithm for reducing the fill-in incurred during Gaussian elimination. HUND has several important properties. It takes a global perspective of the entire matrix, as opposed to local heuristics. It takes into account the assymetry of the input matrix by using a hypergraph to represent its structure. It is ...

متن کامل

A multi-stage approach to clustering and imputation of gene expression profiles

MOTIVATION Microarray experiments have revolutionized the study of gene expression with their ability to generate large amounts of data. This article describes an alternative to existing approaches to clustering of gene expression profiles; the key idea is to cluster in stages using a hierarchy of distance measures. This method is motivated by the way in which the human mind sorts and so groups...

متن کامل

Multi - Objective Hypergraph

In this paper we present a family of multi-objective hypergraph partitioning algorithms based on the multilevel paradigm, which are capable of producing solutions in which both the cut and the maximum subdomain degree are simultaneously minimized. This type of partitionings are critical for existing and emerging applications in VLSI CAD as they allow to both minimize and evenly distribute the i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Nature Machine Intelligence

سال: 2023

ISSN: ['2522-5839']

DOI: https://doi.org/10.1038/s42256-023-00684-8